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known  <m  free  indexation’  plays  an  important  part  in  the  process  of  determining 
the  referential  properties  of  elements  such  as  anaphora  and  pronominals.  This 
paper  addresses  two  issues.  (1)  We  investigate  the  combinatorics  of  free  index¬ 
ation.  By  relating  the  problem  to  the  n-set  partitioning  problem,  we  show  that 
free  indexation  must  produce  an  exponential  number  of  referentially  distinct 
phrase  structures  given  a  structure  with  n  (independent)  noun  phrases.  (2) 
We  introduce  an  algorithm  for  free  indexation  that  is  defined  compositionally 
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on  phrase  structures.  We  show  how  the  compositional  nature  of  the  algorithm 
makes  it  possible  to  incrementally  interleave  the  computation  of  free  indexa¬ 
tion  with  phrase  structure  construction.  Additionally,  we  prove  the  algorithm 
to  be  an  ‘optimal’  procedure  for  free  indexation.  More  precisely,  by  relating 
the  compositional  structure  of  the  formulation  to  the  combinatorial  analysis, 
we  show  that  the  algorithm  enumerates  precisely  all  possible  indexings,  without 
duplicates. 
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1  Free  Indexation 

Consider  the  ambiguous  sentence: 


(1)  John  believes  Bill  will  identify  him 


In  (1),  the  pronominal  “him”  can  be  interpreted  as  being  coreferential  with 
“John”,  or  with  some  other  person  not  named  in  (1),  but  not  with  “Bill”.  We 
can  represent  these  various  cases  by  assigning  indices  to  all  noun  phrases  in  a 
sentence  together  with  the  interpretation  that  two  noun  phrases  are  coreferential 
if  and  only  if  they  are  coindexed,  that  is,  if  they  have  the  same  index.  Hence 
the  following  indexings  represent  the  three  coreference  options  for  pronominal 
“him”:1 

(2)  a.  Johni  believes  Bill2  will  identify  himi 

b.  Johni  believes  Bill2  will  identify  him3 

c.  *Johni  believes  Bill2  will  identify  him2 


In  the  principles-and-parameters  framework  (Chomsky  [3]),  once  indices 
have  been  assigned,  general  principles  that  state  constraints  on  the  locality 
of  reference  of  pronominals  and  names  (e.g.  “John”  and  “Bill”)  will  conspire 
to  rule  out  the  impossible  interpretation  (2c)  while,  at  the  same  time,  allow 
the  other  two  (valid)  interpretations.  The  process  of  assigning  indices  to  noun 
phrases  is  known  as  “free  indexation,”  which  has  the  following  general  form: 

(4)  Assign  indices  freely  to  all  noun  phrases.2 

In  such  theories,  free  indexation  accounts  for  the  fact  that  we  have  coreferential 
ambiguities  in  language.  Other  principles  interact  so  as  to  limit  the  number 
of  indexings  generated  by  free  indexation  to  those  that  are  semantically  well- 
formed. 

'Note  that  the  indexing  mechanism  used  above  is  too  simplistic  a  framework  to  handle 
binding  examples  involving  inclusion  of  reference  such  as: 

(3)  a.  Wei  think  that  Ii  will  win 

b.  Wei  think  that  lj  will  win 

c.  *We,  like  myself, 

d.  John  told  Bill  that  they  should  leave 


Richer  schemes  that  address  some  of  these  problems,  for  example,  by  representing  indices  as 
sets  of  numbers,  have  been  proposed.  See  Lasnik  [9]  for  a  discussion  on  the  limitations  of,  and 
alternatives  to,  simple  indexation.  Also,  Higginbotham  [7]  has  argued  against  coindexation 
(a  symmetric  relation),  and  in  favour  of  directed  links  between  elements  (linking  theory).  In 
general,  there  will  be  twice  as  many  possible  ‘linkings'  as  indexings  for  a  given  structure. 
However,  note  that  the  asymptotic  results  of  Section  2  obtained  for  free  indexation  will  also 
hold  for  linking  theory. 

JThe  exact  form  of  (4)  varies  according  to  different  versions  of  the  theory.  For  example, 
in  Chomsky  (4)  (pg.59),  free  indexation  is  restricted  to  apply  to  A-positions  at  the  level  of 
S-structure,  and  to  A-positions  at  the  level  of  logical  form. 
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In  theory,  since  the  indices  are  drawn  from  the  set  of  natural  numbers,  there 
exists  an  infinite  number  of  possible  indexings  for  any  sentence.  However,  we 
are  only  interested  in  those  indexings  that  are  distinct  with  respect  to  semantic 
interpretation.  Since  the  interpretation  of  indices  is  concerned  only  with  the 
equality  (and  inequality)  of  indices,  there  are  only  a  finite  number  of  semanti¬ 
cally  different  indexings.3  For  example,  “Johnj  likes  Mary2”  and  “John23  likes 
Mary4”  are  considered  to  be  equivalent  indexings.  Note  that  the  definition  in  (4) 
implies  that  “John  believes  Bill  will  identify  him”  has  two  other  indexings  (in 
addition  to  those  in  (2)): 

(5)  a.  *Johni  believes  Billi  will  identify  himi 
b.  *Johni  believes  Billi  will  identify  him2 


In  some  versions  of  the  theory,  indices  are  only  freely  assigned  to  those  noun 
phrases  that  have  not  been  coindexed  through  a  rule  of  movement  (Move-a). 
(see  Chomsky  [3]  (pg.331)).  For  example,  in  “Whoi  did  John  see  [jvpt]i?”,  the 
rule  of  movement  effectively  stipulates  that  “Who”  and  its  trace  noun  phrase 
must  be  coreferential.  In  particular,  this  implies  that  free  indexation  must  not 
assign  different  indices  to  “who”  and  its  trace  element.  For  the  purposes  of  free 
indexation,  we  can  essentially  ‘collapse’  these  two  noun  phrases,  and  treat  them 
as  if  they  were  only  one.  Hence,  this  structure  contains  only  two  independent 
noun  phrases.4 


2  The  Combinatorics  of  Free  Indexation 

In  this  section,  we  show  that  free  indexation  generates  an  exponential  number 
of  indexings  in  the  number  of  independent  noun  phrases  in  a  phrase  structure. 
We  achieve  this  result  by  observing  that  the  problem  of  free  indexation  can  be 
expressed  in  terms  of  a  well-known  combinatorial  partitioning  problem. 

Consider  the  general  problem  of  partitioning  a  set  of  n  elements  into  m  non¬ 
empty  (disjoint)  subsets.  For  example,  a  set  of  four  elements  {w,x,  y,  z}  can  be 
partitioned  into  two  subsets  in  the  following  seven  ways: 

{iu,x,y}{z}  {u>,  x){y,  z} 

{w,x,z){y}  {u;,y}{x,z} 

{iu,y,z}{x}  {u;,z}{x,y} 

{x,y,z}{u;} 

3 In  other  words,  there  are  only  a  finite  number  of  equivalence  classes  on  the  relation  ‘same 
coreference  relations  hold. '  This  can  easily  be  shown  by  induction  on  the  number  of  indexed 
elements. 

‘Technically,  “who”  and  its  trace  are  said  to  form  a  chain.  Hence,  the  structure  in  question 
contains  two  distinct  chains. 
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The  number  of  partitions  obtained  thus  is  usually  represented  using  the  nota¬ 
tion  {^}  (Knuth  [8]).  In  general,  the  number  of  ways  of  partitioning  n  elements 
into  m  sets  is  given  by  the  following  formula.  (See  Purdom  k  Brown  [10]  for  a 
discussion  of  (6).) 

(6) 

{m+l}  =  {m}+(,"+l){m“  l}’ 

The  number  of  ways  of  partitioning  n  elements  into  zero  sets,  {  £  } ,  is  defined 
to  be  zero  for  n  >  0  and  one  when  n  —  0.  Similarly,  {^} ,  the  number  of  ways 
of  partitioning  zero  elements  into  m  sets  is  zero  for  m  >  0  and  one  when  m  =  0. 

We  observe  that  the  problem  of  free  indexation  may  be  expressed  as  the 
problem  of  assigning  1 , 2, . . . ,  n  distinct  indices  to  n  noun  phrases  where  n  is 
the  number  of  noun  phrases  in  a  sentence.  Now,  the  general  problem  of  assigning 
m  distinct  indices  to  n  noun  phrases  is  isomorphic  to  the  problem  of  partitioning 
n  elements  into  m  non-empty  disjoint  subsets.  The  correspondence  here  is  that 
each  partitioned  subset  represents  a  set  of  noun  phrases  with  the  same  index. 
Hence,  the  number  of  indexings  for  a  sentence  with  n  noun  phrases  is: 

(7) 


(The  quantity  in  (7)  is  commonly  known  as  Bell’s  Exponential  Number 
Bn\  see  Berge  [2].)  The  recurrence  relation  in  (6)  has  the  following  solution 
(Abramowitz  [1]): 

(8) 

Using  (8),  we  can  obtain  a  finite  summation  form  for  the  number  of  index¬ 
ings: 


(9) 


n  m 


m  =  1  *=0 


(zlT—  j f 

(m  —  jfe)!  jfc! 


It  can  also  be  shown  (Graham  [6])  that  B„  is  asymptotically  equal  to  (10): 


(10) 


mnnem'- * 
\/ln  n 


1 


3 


where  the  quantity  mn  is  given  by: 


(U)  I 

mn  In  m„  =  n  -  - 

That  is,  (10)  is  both  an  upper  and  lower  bound  on  the  number  of  indexings. 
More  concretely,  to  provide  some  idea  of  how  fast  the  number  of  possible  in¬ 
dexings  increases  with  the  number  of  noun  phrases  in  a  phrase  structure,  the 
following  table  exhibits  the  values  of  (9)  for  the  first  dozen  values  of  n: 


NPs 

Indexings 

NPs 

Indexings 

1 

1 

7 

877 

2 

2 

8 

4140 

3 

5 

9 

21147 

4 

15 

10 

115975 

5 

52 

11 

678570 

6 

203 

12 

4123597 

3  A  Compositional  Algorithm 

In  this  section,  we  will  define  a  compositional  algorithm  for  free  indexation  that 
provably  enumerates  all  and  only  all  the  possible  indexings  predicted  by  the 
analysis  of  the  previous  section. 

The  po-parser  is  a  parser  based  on  a  principles-and-parameters  framework 
with  a  uniquely  flexible  architecture  ([5]).  In  this  parser,  linguistic  principles 
such  as  free  indexation  may  be  applied  either  incrementally  as  bottom-up  phrase 
structure  construction  proceeds,  or  as  a  separate  operation  after  the  complete 
phrase  structure  for  a  sentence  is  recovered.  The  po-parser  was  designed 
primarily  as  a  tool  for  exploring  how  to  organize  linguistic  principles  for  efficient 
processing.  This  freedom  in  principle  application  allows  one  to  experiment  with 
a  wide  variety  of  parser  configurations. 

Perhaps  the  most  obvious  algorithm  for  free  indexation  is,  first,  to  simply 
collect  all  noun  phrases  occurring  in  a  sentence  into  a  list.  Then,  it  is  easy  to 
obtain  all  the  possible  indexing  combinations  by  taking  each  element  in  the  list 
in  turn,  and  optionally  coindexing  it  with  each  element  following  it  in  the  list. 
This  simple  scheme  produces  each  possible  indexing  without  any  duplicates  and 
works  well  in  the  case  where  free  indexing  applies  after  structure  building  has 
been  completed. 

The  problem  with  the  above  scheme  is  that  it  is  not  flexible  enough  to  deal 
with  the  case  when  free  indexing  is  to  be  interleaved  with  phrase  structure 
construction.  Conceivably,  one  could  repeatedly  apply  the  algorithm  to  avoid 
missing  possible  indexings.  However,  this  is  very  inefficient,  that  is,  it  involves 
much  duplication  of  effort.  Moreover,  it  may  be  necessary  to  introduce  extra 
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machinery  to  keep  track  of  each  assignment  of  indices  in  order  to  avoid  the 
problem  of  producing  duplicate  indexings.  Another  alternative  is  to  simply  delay 
the  operation  until  all  noun  phrases  in  the  sentence  have  been  parsed.  (This  is 
basically  the  same  arrangement  as  in  the  non-interleaved  case.)  Unfortunately, 
this  effectively  blocks  the  interleaved  application  of  other  principles  that  are 
logically  dependent  on  free  indexation  to  assign  indices.  For  example,  this  means 
that  principles  that  deed  with  locality  restrictions  on  the  binding  of  anaphors 
and  pronominals  cannot  be  interleaved  with  structure  building  (despite  the  fact 
that  these  particular  parser  operations  can  be  effectively  interleaved). 

An  algorithm  for  free  indexation  that  is  defined  compositionally  on  phrase 
structures  can  be  effectively  interleaved.  That  is,  free  indexing  should  be  de¬ 
fined  so  that  the  indexings  for  a  phrase  is  some  function  of  the  indexings  of 
its  sub-constituents.  Then,  coindexings  can  be  computed  incrementally  for  all 
individual  phrases  as  they  are  built.  Of  course,  a  compositional  algorithm  can 
also  be  used  in  the  non-interleaved  case. 

Basically,  the  algorithm  works  by  maintaining  a  set  of  indices  at  each  sub- 
phrase  of  a  parse  tree.5  Each  index  set  for  a  phrase  represents  the  range  of 
indices  present  in  that  phrase.  For  example,  “Who*  did  John,  see  <,?”  has  the 
following  phrase  structure  and  index  sets: 

(12)  [< cp  [/vp  Who/]  [g  did  [/p  [jvp  John,]  [Vp  see  [NP  <,]]]]] 

(O')  {<}  {*,;}  {<J>  {;}  {.}  {<} 

There  are  two  separate  tasks  to  be  performed  whenever  two  (or  more)  phrases 
combine  to  form  a  larger  phrase.6 * 8  First,  we  must  account  for  the  possibility  that 
elements  in  one  phrase  could  be  coindexed  (cross-indexed)  with  elements  from 
the  other  phrase.  This  is  accomplished  by  allowing  indices  from  one  set  to 
be  (optionally)  merged  with  distinct  indices  from  the  other  set.  For  example, 
the  phrases  “[/vpJohnj]”  and  “[vp  likes  him,]”  have  index  sets  {i}  and  {j}, 
respectively.  Free  indexation  must  allow  for  the  possibilities  that  “John”  and 
“him”  could  be  coindexed  or  maintain  distinct  indices.  Cross-indexing  accounts 
for  this  by  optionally  merging  indices  i  and  j.  Hence,  we  obtain: 

(13)  a.  John,  likes  him,  ,  i  merged  with  j 

b.  John,  likes  him,-,  i  not  merged  with  j 


5  For  expository  reasons,  we  consider  only  pure  indices.  The  actual  algorithm  keeps  track  of 
additional  information,  such  as  agreement  features  like  person,  number  and  gender,  associated 

with  each  index.  For  example,  irrespective  of  configuration,  “Mary"  and  “him"  can  never  have 

the  same  index. 

8Some  eaders  may  realize  that  the  algorithm  must  have  an  additional  step  in  cases  where 
the  larger  phrase  itself  may  be  indexed,  for  instance,  as  in  [atpJ/vp,  John's  )  mother].  In  such 
cases,  the  third  step  is  simply  to  merge  the  singleton  set  consisting  of  the  index  of  the  larger 
phrase  with  the  result  of  cross- indexing  in  the  first  step.  (For  the  above  example,  the  extra 
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Secondly,  we  must  find  the  index  set  of  the  aggregate  phrase.  This  is  just 
the  set  union  of  the  index  sets  of  its  sub-phrases  after  cross-indexation.  In  the 
example,  “John  likes  him”,  (13a)  and  (13b)  have  index  sets  {i}  and  {»,  j). 

More  precisely,  let  Ip  be  the  set  of  all  indices  associated  with  the  Binding 
Theory-relevant  elements  in  phrase  P.  Assume,  without  loss  of  generality,  that 
phrase  structures  are  binary  branching.  Consider  a  phrase  P  =  [p  X  Y]  with 
immediate  constituents  X  and  y.  Then: 

1.  Cross  Indexing:  Let  lx  represent  those  elements  of  lx  which  are  not  also 
members  of  2y,  that  is,  (Ix  -  Iy)-  Similarly,  let  /y  be  (7y  —  lx)-7 

(a)  If  both  lx  and  /y  are  empty  sets,  then  done. 

(b)  Let  x  and  y  be  members  of  lx  and  7y ,  respectively. 

(c)  Either  merge  indices  x  and  y  or  do  nothing. 

(d)  Repeat  from  step  (la)  substituting  lx  -  {*}  and  7y  —  {y}  for  lx  and 
/y,  respectively,  if  x  and  y  have  been  merged. 

2.  Index  Set  Propagation:  Ip  =  Ix  U  Iy . 

The  nondeterminism  in  step  (lc)  of  cross-indexing  will  generate  all  and  only 
all  (i.e.  without  duplicates)  the  possible  indexings.  We  will  show  this  in  two 
parts.  First,  we  will  argue  that  the  above  algorithm  cannot  generate  duplicate 
indexings.  That  is,  the  algorithm  only  generates  distinct  indexings  with  respect 
to  the  interpretation  of  indices.  As  shown  in  the  previous  section,  the  com¬ 
binatorics  of  free-indexing  indicates  that  there  are  only  Bn  possible  indexings. 
Next,  we  will  demonstrate  that  the  algorithm  generates  exactly  that  number 
of  indexings.  If  the  algorithm  satisfies  both  of  these  conditions,  then  we  have 
proved  that  it  generates  all  the  possible  indexings  exactly  once. 

1.  Consider  the  definition  of  cross-indexing.  Ix  represents  those  indices  in  X 
that  do  not  appear  in  Y.  (Similarly  for  Iy.)  Also,  whenever  two  indices 
are  merged  in  step  (lb),  they  are  ‘removed’  from  Ix  and  IY  before  the 
next  iteration.  Thus,  in  each  iteration,  x  and  y  from  step  (lb)  are  ‘new’ 
indices  that  have  not  been  merged  with  each  other  in  a  previous  iteration. 
By  induction  on  tree  structures,  it  is  easy  to  see  that  two  distinct  indices 
cannot  be  merged  with  each  other  more  than  once.  Hence,  the  algorithm 
cannot  generate  duplicate  indexings. 

2.  We  now  demonstrate  why  the  algorithm  generates  exactly  the  correct  num¬ 
ber  of  indexings  by  means  of  a  simple  example.  Without  loss  of  generality, 
consider  the  following  right-branching  phrase  scheme: 

step  is  to  just  merge  {t}  with  { j} . )  For  expository  reasons,  we  will  ignore  such  cases.  Note 
that  no  loss  of  generality  is  implied  since  a  structure  of  the  form  [atp,  (y />, ...  a  ...]...  0 . . .] 
can  be  can  always  be  handled  as  [/>,  [jvp,][pjljvpj . . .  a  0 . . .]]. 

7Note  that  Ix  and  Iy  are  defined  purely  for  notational  convenience.  That  is,  the  algorithm 
directly  operates  on  the  elements  of  I\  and  Iy. 
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Now  consider  the  following  decision  tree  for  computing  the  possible  index¬ 
ings  of  the  right-branching  tree  in  a  bottom-up  fashion: 


NPa 


Decision  Tree 


N  Pi 


NPj 


NPt 


{«■} 


Each  node  in  the  tree  represents  the  index  set  of  the  combined  phrase 
depending  on  whether  the  noun  phrase  at  the  same  level  is  cross-indexed 
or  not.  For  example,  {i}  and  {»',  j}  on  the  level  corresponding  to  NPj  are 
the  two  possible  index  sets  for  the  phrase  P,j.  The  path  from  the  root  to 
an  index  set  contains  arcs  indicating  what  choices  (either  to  coindex  or  to 
leave  free)  must  have  been  made  in  order  to  build  that  index  set.  Next, 
let  us  just  consider  the  cardinality  of  the  index  sets  in  the  decision  tree, 
and  expand  the  tree  one  more  level  (for  N  Pi): 
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1 


1  2 


k 

1  2  2  2  3 


Informally  speaking,  observe  that  each  decision  tree  node  of  cardinality  i 
‘generates’  i  child  nodes  of  cardinality  i  plus  one  child  node  of  cardinality 
i  +  1.  Thus,  at  any  given  level,  if  the  number  of  nodes  of  cardinality  m 
is  cm,  and  the  number  of  nodes  of  cardinality  m  —  1  is  cm_i,  then  at  the 
next  level  down,  there  will  be  mcm  +  em_i  nodes  of  cardinality  m.  Let 
c(n,  m)  denote  the  number  of  nodes  at  level  n  with  cardinality  m.  Let  the 
top  level  of  the  decision  tree  be  level  1.  Then: 

(14) 


c(n  -f  1,  m  -f  1)  =  c(n,  m)  +  (m  +  l)c(n,  m  +  1) 

Observe  that  this  recurrence  relation  has  the  same  form  as  equation  (6). 
Hence  the  algorithm  generates  exactly  the  same  number  of  indexings  as 
demanded  by  combinatorial  analysis. 


4  Conclusions 

This  paper  has  shown  that  free  indexation  produces  an  exponential  number  of 
indexings  per  phrase  structure.  This  implies  that  all  algorithms  that  compute 
free  indexation,  that  is,  assign  indices,  must  also  take  at  least  exponential  time. 
In  this  section,  we  will  discuss  whether  it  is  possible  for  a  principle-based  parser 
to  avoid  the  combinatorial  ‘blow-up’  predicted  by  analysis. 

First,  let  us  consider  the  question  whether  the  ‘full  power’  of  the  free  index¬ 
ing  mechanism  is  necessary  for  natural  languages.  Alternatively,  would  it  be 
possible  to  ‘shortcut’  the  enumeration  procedure,  that  is,  to  get  away  with  pro¬ 
ducing  fewer  than  Bn  indexings?  After  all,  it  is  not  obvious  that  a  sentence  with 
a  valid  interpretation  can  be  constructed  for  every  possible  indexing.  However, 
it  turns  out  (at  least  for  small  values  of  n;  see  examples  (15)  and  (16)  below) 

TTo  make  the  boundary  case*  match,  just  define  c(0, 0)  to  be  1,  and  let  c(0,  m)  =  0  and 
c(n,0)  s  0  for  m  >  0  and  n  >  0,  respectively. 
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that  language  makes  use  of  every  combination  predicted  by  analysis.  This  im¬ 
plies,  that  all  parsers  must  be  capable  of  producing  every  indexing,  or  else  miss 
valid  interpretations  for  some  sentences. 

There  are  B3  =  5  possible  indexings  for  three  noun  phrases:8 


(15)  a.  Johni  wanted  PRO\  to  forgive  himself]  (111) 

b.  John]  wanted  PRO\  to  forgive  him?  (112) 

c.  John]  wanted  Ma/y2  to  forgive  himj  (121) 

d.  John]  wanted  Mary2  to  forgive  herself?  (122) 

e.  John]  wanted  Ma ty2  to  forgive  him3  (123) 


Similarly,  there  are  fifteen  possible  indexings  for  four  noun  phrases: 


(16)  a.  Johnj  persuaded  himself]  that  hei  should  give  himself]  up  (1111) 

b.  Johnj  persuaded  Mary2  PRO*  to  forgive  herself?  (1222) 

c.  Johnj  persuaded  himself]  PRO\  to  forgive  her?  (U12) 

d.  Johnj  persuaded  Mary.,  PROi  to  forgive  himi  (1221) 

e.  John,  persuaded  Mary2  PRO 2  to  forgive  him3  (1223) 

f.  John]  wanted  Bills  to  ask  Mary3  PRO3  to  leave  (1233) 

g.  John]  wanted  PRO\  to  tell  Mary2  about  herself?  (1122) 

h.  John,  wanted  Mary2  to  tell  himi  about  himself]  (1211) 

i.  Johnj  wanted  PRO\  to  tell  Mary2  about  himself]  (H21) 

j.  John,  wanted  Bill?  to  tell  Mary3  about  himself?  (1232) 

k.  Johnj  wanted  PRO\  to  tell  Mary2  about  Toms  (1123) 

l.  John,  wanted  Mary2  to  tell  him,  about  Tom3  (1213) 

m.  John,  wanted  Mary2  to  tell  Tom?  about  him,  (1231) 

n.  John,  wanted  Mary2  to  tell  Toms  about  BilU  (1234) 


Although  it  may  be  the  case  that  a  parser  must  be  capable  of  producing 
every  possible  indexing,  it  does  not  necessarily  follow  that  a  parser  must  enu¬ 
merate  every  indexing  when  parsing  a  particular  sentence.  In  fact,  for  many 
cases,  it  is  possible  to  avoid  exhaustively  exploring  the  search  space  of  possibil¬ 
ities  predicted  by  combinatorial  analysis.  To  do  this,  basically  we  must  know, 
a  priori,  what  classes  of  indexings  are  impossible  for  a  given  sentence.  By  fac¬ 
toring  in  knowledge  about  restrictions  on  the  locality  of  reference  of  the  items 
to  be  indexed  (i.e.  binding  principles),  it  is  possible  to  explore  the  space  of  in¬ 
dexings  in  a  controlled  fashion.  For  example,  although  free  indexation  implies 
that  there  are  five  indexings  for  “John  thought  [5  Tom  forgave  himself  ]  ”,  we 
can  make  use  of  the  fact  that  “himself’  must  be  coindexed  with  an  element 
within  the  subordinate  clause  to  avoid  generating  indexings  in  which  “Tom” 
and  “himself”  are  not  coindexed.9  Note  that  the  early  elimination  of  ill-formed 
indexings  depends  crucially  on  a  parser’s  ability  to  interleave  binding  principles 
with  structure  building.  But,  as  discussed  in  Section  3,  the  interleaving  of  bind¬ 
ing  principles  logically  depends  on  the  ability  to  interleave  free  indexation  with 

*  PRO  is  an  empty  (non-overt)  noun  phrase  element. 
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structure  building.  Hence  the  importance  of  an  formulation  of  free  indexation, 
such  as  the  one  introduced  in  Section  3,  which  can  be  effectively  interleaved. 
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that  “Tom"  cannot  be  coindexed  with  “John”,  we  can  pare  the  list  of  indexings  down  to  just 
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